Situation based speech recognition for structuring baseball live games

نویسندگان

  • Atsushi Sako
  • Tetsuya Takiguchi
  • Yasuo Ariki
چکیده

It is a difficult problem to recognize baseball live speech because the speech is rather fast, noisy, emotional and disfluent due to rephrasing, repetition, mistake and grammatical deviation caused by spontaneous speaking style. To solve these problems, we have been studied the speech recognition method incorporating the baseball game task-dependent knowledge as well as an announcer’s emotion in commentary speech [1]. In addition, in this paper, we propose the situation prediction model based on word co-occurrence. Owing to these proposed models, speech recognition errors are effectively prevented. This method is formalized in the framework of probability theory and implemented in the conventional speech decoding (Viterbi) algorithm. The experimental results showed that the proposed approach improved the structuring and segmentation accuracy as well as keywords accuracy.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structuring of Baseball Live Games Based on Speech Recognition Using Task Dependent Knowledge

It is a difficult problem to recognize baseball live speech because the speech is rather fast, noisy and disfluent due to rephrasing, repetition, mistake and grammatical deviation caused by spontaneous speaking style. To solve these problems, we propose in this paper a speech recognition method of incorporating the baseball game knowledge such as counting of inning, out, strike and ball. Due to...

متن کامل

Structuring of baseball live games based on speech recognition using task dependant knowledge

It is a difficult problem to recognize baseball live speech because the speech is rather fast, noisy and disfluent due to rephrasing, repetition, mistake and grammatical deviation caused by spontaneous speaking style. To solve these problems, we propose in this paper a speech recognition method of incorporating the baseball game knowledge such as counting of inning, out, strike and ball. Due to...

متن کامل

Real-Time Closed-Captioning Using Speech Recognition

There is a great need for more TV programs to be closed-captioned to help hearing impaired and elderly people watch TV. For that purpose, automatic speech recognition is expected to contribute to providing text from speech in real-time. NHK has been using speech recognition for closed-captioning of some of its news, sports and other live TV programs. In news programs, automatic speech recogniti...

متن کامل

Live speech recognition in sports games by adaptation of acoustic model and language model

This paper proposes a method to automatically extract keywords from baseball radio speech through LVCSR for highlight scene retrieval. For robust recognition, we employed acoustic and language model adaptation. In acoustic model adaptation, supervised and unsupervised adaptations were carried out using MLLR+MAP. By this two level adaptation, word accuracy was improved by 28%. In language model ...

متن کامل

Robust Scene Recognition for Baseball Broadcast

This paper introduces a statistical framework for recognizing scenes from a baseball broadcast video. Inspired by the successes of statistical approaches in speech recognition field, we propose a data-driven approach to provide robust scene recognition. We use several global features and apply multi-stream Hidden Markov Models (HMMs) to control the weights among them. To achieve robustness agai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005